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^ • Abstract 

i-^ , We discuss the hierarchy of subphase transitions in first-order-hke nucleation 

O ' processes for an examplified aggregation transition of heteropolymers. We 
perform an analysis of the microcanonical entropy, i.e., the density of states is 

^ ■ considered as the central statistical system quantity since it connects system- 

CS| . specific entropic and energetic information in a natural and unique way. 

m 
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1. Introduction 



H ' The noncovalent cooperative effects in structure formation processes on 

mesoscopic scales let linear polymers be very interesting objects for studies of 
the statistical mechanics and thermodynamics of nucleation processes, even 
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on a fundamental level. Structural properties of polymers can typically be 
well described by means of simple, coarse-grained models of beads and sticks 
(or springs) representing the monomers and the covalent bonds between ad- 
jacent monomers in the chain, respectively. Contemporary, sophisticated 
generalized-ensemble Monte Carlo simulation techniques as well as large- 
scale computational resources enable the precise and systematic analysis of 
thermodynamic properties of all structural phases of coarse-grained polymer 
models by means of computer simulations. Among the most efficient sim- 



ulations_methods are multicanonical samplin 
niques 



, |2|, replica-exchange tech- 



a. 



, y], and the Wang-Landau method 
The high precision of the numerical data for quantities that are hardly 
accessible in analytic calculations - one of the most prominent and, as it will 
turn out in the following, most relevant system-specific quantities is the den- 
sity (or number) of states with energy E, g{E) - opens new perspectives for 
the physical interpretation and classification of cooperative processes such as 
phase transitions. This is particularly interesting for small systems, where 
conventional statistical analyses are often little systematic and a general con- 
cept seems to be missing. This is apparently refiected in conformational stud- 
ies in the biosciences, where often novel terminologies are invented for basi- 
cally the same classes of transitions. The introduction of a unifying scheme 
appears insofar difficult as finite-size effects in transitions of small systems 
influence the thermal fluctuations of transition indicators like order param- 
eters. Maximum energetic fluctuations, represented by peaks in the speciflc 
heat, do not necessarily coincide with peak temperatures of fluctuations of 
structural quantities such as the radius of gyration [y, l7|. For extremely large 



systems, which are well described by the theoretical concept of the "thermo- 
dynamic limit" , this canonical approach is appealing as the fluctuation peaks 
scale with system size and finally collapse at the same temperature, allowing 
for the definition of a unique transition point. However, for many systems, in 
particular biomolecules, the assumption of the thermodynamic limit is non- 
sensical and the explicit consideration and understanding of finite-size effects 
is relevant. 

2. Microcanonical vs. canonical temperature 

The microcanonical analysis |8| allows for such an in-depth analysis of 
smallness effects. It is completely based on the entropy as a function of the 
system energy, which is related to the density of states via S{E) = /cb In g{E), 
where k-^ is the Boltzmann constant. A major advantage of this quantity is 
the possibility to introduce the temperature as a derived quantity via 

'dSiE) 



T{E) 



(1) 



dE 

which is commonly refered to as the "microcanonical temperature". This 
terminology is misleading since S{E) and thus T{E) do not depend on the 
choice of any statistical ensemble associated to a certain thermal environment 
of the system. Thus, the physical meaning of S{E) and T{E) is not restricted 
to systems well-described by the microcanonical ensemble only (i.e., for sys- 
tems with constant energy). The introduction of the temperature via Eq. ([T]) 
is also useful for another reason. It applies independently of the system size 
and it is not coupled to any equilibrium condition. To make use of theoretical 
concepts like the thermodynamic limit or quasiadiabatic process flow is not 
necessary. 



The typically used heatbath concept for introducing the temperature in 
the context of the thermodynamic equilibrium of heatbath and system is 
useful for large systems when thermal fluctuations become less relevant and 
finite-size effects disappear. However, as it has already been known from 
the early days of statistical mechanics, the statistical ensembles turn over to 
the microcanonical ensemble in the thermodynamic limit. This is easily seen 
for the example of the fluctuations about the mean energy in the canonical 
ensemble. In this case, the canonical statistical partition function, linked 
to the free energy F(T^y^^^), can be written as an integral over the energy 
space, 

^(^system) = / dEg{E)e-''/^^'^^'y^^^- = e^/'=«^='y--, (2) 

where the canonical system temperature T^y^^^ corresponds in equilibrium to 
the heatbath temperature Thoatbath (which is an adjustable external thermal 
control parameter in experiments): 

-^system — -'heatbath- {'J J 

Therefore, in equilibrium, the mean energy at a given heatbath temperature 
can simply be calculated as: 

(i5)(Theatbath) = -^77f^ T f dE E g{E)e-^'^^''^-^-^^\ (4) 

^l J heatbath J J 

The heat capacity Cy(Thcatbath) = (i(E)/(iThcatbath = {{E'^)-{E)'^)lkBTl^.^^^^th 
must always be nonnegative because of the thermodynamic stability of mat- 
ter, i.e., {E) increases monotonously with Thcatbath- For this reason, the de- 
pendence of {E) on the temperature can trivially be inverted, Tl^ystem((-^)); 
where we have made use of the equilibrium condition (|3]). In complete analogy 



to the microcanonical definition of tlie temperature in Eq. ([T]), we introduce 
tlie canonical entropy via tlie relation 

-^ system I \-^/ J 



dS'^^'^iE)) 



(5) 

Ny 



d{E) 

where particle number A^ and volume V are kept constant. The canonical 
entropy can explicitly be expressed by the celebrated equation that links 
thermodynamics and statistical mechanics: 

S'^'\{E)) = ^..„ \,r,.. m - F(T-^en.((^)))] • (6) 

systcmV \ / y 

Instead of considering the canonical ensemble by fixing Theatbath, V, and A^ 
as external parameters, we have turned to the caloric representation, where 
(E), V, and A^ are treated as independent variables. If the fluctuations 
of energy about the mean value (E) vanish, the canonical ensemble thus 
corresponds to the microcanonical ensemble. This is obvious in the thermo- 
dynamic limit, where the relative width of the canonical energy distribution. 



AE/{E) = ^WV^WKE) = v/^^BT4,tbathCl/(Theatbath)/(i?) « l/v^ 

vanishes, liin.N^ooi'AE / (E)) = 0, since Cy and (E) are extensive variables 
as they scale with A^. Thus, (E) = E and T{E) = T^^^^^^^{{E)) in the 
thermodynamic limit. 

3. Small systems 

Microcanonical and canonical temperatures do typically not coincide, if 
finite-size effects matter. This is particularly apparent under conditions 
where cooperative changes of macrostates, such as conformational transitions 
of finite molecular systems, occur. In transitions with structure formation. 



the conformational entropies associated to volume and surface effects in the 
formation of compact states of the system compete with the energetic dif- 
ferences of particles located at the surface or in the interior of the structure. 
Examples for such morphologies of finite size are atomic clusters, spin clus- 
ters, the interface of demixed fluids, globular polymers or proteins, crystals, 
etc. Since many interesting systems such as heterogeneous biomolecules like 
proteins are "small" in a sense that a thermodynamic limit does not exist at 
all, it is useful to build up the analysis of transitions of small systems on the 
most general grounds. These are, as we have argued above, best prepared by 
the microcanonical or caloric approach. 

The folding of a protein is an example for a subtle structure formation 
process of a finite system, where effects on nanoscopic scales (e.g., hydro- 
gen bonds being responsible for local secondary structures such as helices 
and sheets) and cooperative behavior on mesoscopic scales (such as the less 
well-defined hydrophobic effect which primarily drives the formation of the 
global, tertiary structures) contribute to the stable assembly of the native 
fold which is connected to the biological function of the protein. Proteins are 
linear polypeptides, i.e., they are composed of amino acids linked by peptide 
bonds. There are twenty types of amino acids that have been identified in 
bioproteins, all of them differ in their chemical composition and thus possess 
substantial differences in their physical properties and chemical reactivity. 
The mechanism of folding depends on the sequence of amino acids, not only 
the content, i.e., a protein is a disordered system. It is one of the central 
questions, which mutations of a given amino-acid sequence can lead to a rel- 
evant change of morphology and thus the loss of functionality. The atomistic 



interaction types, scales, and ranges are different as well. For this reason, 
in compact folds, frustration effects may occur. Disorder and frustration 
cause glassy behavior, but how glassy is a single protein and can this be 
generalized? 

Since details seem to be relevant for folding, it has commonly been be- 
lieved that the folding of a certain protein is a highly individual process. 
Thus, it was a rather surprising discovery that the search for a stable fold 
can be a cooperative one-step process which can qualitatively and quantita- 
tively be understood by means of the statistical analysis of a single effective, 
mesoscopic order parameter. The free-energy landscape turned out to be 
very simple (exhibiting only a single barrier between folded and unfolded 
conformations) for this class of "two-state folders" [9|, |lO| • Of course, folding 



pathways 
can occur 



or other proteins can be more complex as also intermediate states 
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12| . However, this raises the question about cooperativity and 
the generalization of folding behavior in terms of conformational transitions 
similar to phase transitions in other fields of statistical mechanics. In the fol- 
lowing, we are going to discuss a structure formation process, the aggregation 
of a finite system of heteropolymers, by a general microcanonical approach in 
order to show how the conformational transition behavior of a small system 
is indeed related to thermodynamic phase transitions. 

4. Exemplified nucleation process: Aggregation of proteins 

As an example for the occurrence of hierarchies of subphase transitions 
accompanying a nucleation process, we are going to discuss molecular ag- 



gregation |l3 



- |l5| by means of a simple coarse-grained hydrophobic-polar 



heteropolymer model [16|, Il7[ • In this so-called AB model, only hydropho- 
bic (A) and hydrophilic (B) monomers line up in linear heteropolymer se- 
quences. In the following, we consider the aggregation of four chains with 
13 monomer s | l4l| . All chains have the same Fibonacci sequence AB2AB2- 
ABAB2AB [16|. Folding and a ggr egation of this heteropolymer have already 



been subject of former studies 
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14. 



In the model used here, bonds between adjacent monomers have fixed 
length unity. Nonbonded monomers of individual chains but likewise mono- 
mers of different chains interact pairwisely via Lennard- Jones-like potentials. 
The explicit form of the potentials depends on the types of the interacting 
monomers. Pairs of hydrophobic and unlike monomers attract, pairs of polar 
monomers repulse each other. This effectively accounts for the hydrophobic- 
core formation of proteins in polar solvent. Details of our aggregation model 
and of the implementation of the multicanonical Monte Carlo simulation 

n 

method are described in Ref. |14| . 

The multicanonical computer simulations enabled us to obtain a pre- 
cise estimate for the microcanonical entropy S{E) of the multiple-chain 
system [ij], as shown in Fig. [1] as a function of the energy per monomer, 
e = E/N. The entropy curve is convex in the energetic aggregation transi- 
tion region as expected for a first-order-like nucleation transition of a finite 
system |8|. The Gibbs tangent 'H(e), connecting the two coexistence points 
where concave and convex behavior change, provides the least possible over- 
all concave shape of ^(e) in this region. The difference between the Gibbs 
hull and the entropy curve, AS{e) = 'H(e) — S{e), is also shown in Fig. [1] 
Not only the entropic suppression in the transition region is clearly visible. 
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Figure 1: Microcanonical entropy S{e), the Gibbs hull 'H(e), and the deviation A5(e) = 
'H(e) — '5'(e) as functions of the energy per monomer. 

it is also apparent that the transition possesses an internal structure. 

In order to better understand the subphases, we discuss in the following 
the inverse derivative of the entropy, the microcanonical temperature ([I]), 
T(e), which is plotted in Fig. O The slope of the Gibbs tangent corresponds 
to the Maxwell line in Fig. [2] at the aggregation temperature Tagg ~ 0.217. 
Therefore, the intersection points of the Maxwell line and the temperature 
curve define the energetic phase boundaries Cagg and e^ag, respectively, as 
both phases coexist at Tagg. 

For energies e < eagg ~ —0.43, conformations of a single aggregate, com- 
posed of all four chains, dominate. On the other hand, conformations with 
e > Cfrag ~ 0.05 are mainly entirely fragmented, i.e., all chains can form in- 
dividual conformations, almost independently of each other. The entropy is 
governed by the contributions of the individual translational entropies of the 
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Figure 2: Microcanonical temperature T'(e) as a function of energy per monomer, e = 
E/N, where N is the total number of all monomers in the system. The horizontal Maxwell 
line marks the aggregation temperature Tagg, obtained by the Gibbs construction. Vertical 
dashed lines separate the different phases and subphases, respectively. 

chains, outperforming the conformational entropies. The translational en- 
tropies are only limited by the volume which corresponds to the simulation 
box size. The energetic difference Ag = Cfrag — Cagg, serving as an estimator 
for the latent heat, is obviously larger than zero, Ag ^ 0.48. It thus cor- 
responds to the total energy necessary to entirely melt the aggregate into 
fragments at the aggregation (or melting) temperature. 

The relation between energy and microcanonical temperature in the ag- 
gregate phase and in the fragment phase is as intuitively expected: With 
increasing energy, also T(e) increases. However, much more interesting is 
the behavior of the system in the energy interval eagg < e < e^ag, which 
represents the energetic nucleation transition region. Figure H] clearly shows 
that in our example T(e) changes the monotonic behavior three times in this 
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Figure 3: Representative conformations in the different structural phases. 
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regime. Representative conformations in the diffferent structural phases are 
shown in Fig. |3l The change of monotony of the microcanonical temperature 
curve is called backbending effect, because the temperature decreases while 
energy is increased. This rather little intuitive behavior is a consequence of 
the suppression of entropy in this regime {S{E) is convex in the backbending 
region) . The surface entropy per monomer vanishes in the thermodynamic 



limit 



l|. 



If two chains aggregate (subphase 1 in Fig. [2]), the total translational en- 
tropy of the individual chains is reduced by /c^ln V, where V is the volume 
(corresponding to the simulation box size), whereas the energy of the aggre- 
gate is much smaller than the total energy of the system with the individual 
chains separated. Thus, the energy associated to the interaction between dif- 
ferent chains, i.e., the cooperative formation of inter-chain contacts between 
monomers of different chains, is highly relevant here. This causes the latent 
heat between the completely fragmented and the two-chain aggregate phase 
to be nonzero which signals a first-order-like transition. This procedure con- 
tinues when an additional chain joins a two-chain cluster. Energetically, the 
system enters subphase 2. Qualitatively, the energetic and entropic reasons 
for the transition into this subphase are the same as explained before, since 
it is the same kind of nucleation process. In our example of four chains in- 
teracting with each other, there is another subphase 3 which also shows the 
described behavior. The energetic width of each of the subphase transitions 
corresponds to the respective latent heat gained by adding another chain to 
the already formed cluster. The subphase boundaries (vertical dashed, gray 
lines in Fig. [2]) have been defined to be the inflection points in the raising 
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temperature branches, thus enclosing a complete "oscillation" of the tem- 
perature as a function of energy. The energetic subphase transition points 
are located at eu ~ —0.11 and 623 ~ —0.26, respectively. Therefore, the la- 
tent heat associated to these subphase transitions is in all three cases about 
Aqij ^ 0.16 {i,j = 1,2,3, i ^ j), thus being one third of the total latent 
heat of the complete nucleation process. This reflects the high systematics 
of subphase transitions in flrst-order nucleation processes. 

5. Summary 

The most interesting result from this heteropolymer aggregation study 
is that flrst-order phase transitions such as nucleation processes can be un- 
derstood as a composite of hierarchical subphase transitions, each of which 
exhibits features of flrst-order-like transitions. Since with increasing num- 
ber of chains the microcanonical entropy per chain converges to the Gibbs 
hull in the transition region, the "amplitudes" of the backbending oscilla- 
tions and the individual latent heats of the subphases become smaller and 
smaller [ij, [l5[. Thus, in the thermodynamic limit, the heteropolymer ag- 
gregation transition is a flrst-order nucleation process composed of an inflnite 
number of inflnitesimally "weak" flrst-order-like subphase transitions. 
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